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(54) Content addressable memory 

(57) The implementation of two-dimensional decod- 
ing, necessary to achieve a reasonable array aspect ra- 
tio for a large content addressable memory, is achieved 
by having multiple match lines per physical row, these 
match lines being physically routed on top of the array 
core cell in an upper metal layer. To limit power dissipa- 



tion in the resulting large-capacity content addressable 
memory, the match function is implemented by two or 
more NAND chains per word. Means for achieving the 
precharging and evaluation of these chains, and for im- 
plementing dummy chains for the provision of timing in- 
formation, are also disclosed. 
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Description 

Technical Field 

The present invention relates to" content addressa- 
ble memory (CAM). 

Background information 

CAMs are memories in which data is selected 
based on its contents, rather than its physical location. 
This function is useful for many applications, especially 
when performing a look-up for the purposes of mapping 
from a long identification word to a shorter word. This 
operation is required in many telecom functions, includ- 
ing Asynchronous Transfer Mode (ATM) address trans- 
lation. 

United States Patent No. 5,289,403 entitled "Self- 
Timed Content Addressable Memory Access Mecha- 
nism with Built-in Margin Test Feature" granted to J.D. 
Yetter on February 22, 1994 describes a means of pro- 
viding self-timing to a NOR-match line CAM, using a 
dummy column and dummy row. The bit at the intersec- 
tion of the dummy column and dummy row always miss- 
es, while all other cells in that row always match, and 
this generates the slowest possible mismatch condition 
on the dummy match line. This signal is then used to 
generate a clock for timing of subsequent events, and 
for qualifying ail other match lines. 

Unhed States Patent No. 5,453,948 entitled "Asso- 
ciative Memory 0 granted to M. Yoneda on September 
26, 1995 describes a low-power implementation of a 
match line, in which the match line is realized as a series 
chain, instead of a NOR. 

United States Patent No. 5,455784 entitled "Asso- 
ciative Memory Device with Small Memory Cells Selec- 
tively Storing Data Bits and Donl Care Bits" granted to 
H. Yamada on October 3, 1995 describes a CAM with 
individual storage cells comprised of a series combina- 
tion of two EEPROM devices. The series connection of 
these cells (one per bit) to form a word of CAM results 
in a series match line chain. Shifted transistor threshold 
voltages, resulting from programming, cause the EEP- 
ROM devices to act as either enhancement- or deple- 
tion-mode transistors, and hence "don't cares" may be 
stored by making both devices depletion-mode. 

United States Patent No. 5,469,378 entitled "Con- 
tent Addressable Memory Having Match Line Transis- 
tors Connected in Series and Coupled to Current Sens- 
ing Circuit" granted to R. Albon et al on November 21 , 
1995 describes a series match line chain. 

Most previous CAMs employ a dynamic wired-NOR 
match line pull-down, which provides high speed. How- 
ever, the prior art circuit construction causes match lines 
associated with mismatched words to experience tran- 55 
sitions, while no transitions occur on match lines asso- 
ciated with matched words. Since the number of match- 
es is much smaller than the number of mismatches, the 



number of transitions, and hence the power dissipation, 
is excessive for a wired-NOR match line CAM. 

Summary Of The Invention 

5 

The present invention intends to implement CAMs 
of sufficient capacities for ATM address translation table 
applications, for example. The inadequacy of tech- 
s niques known as prior art is due to both architectural 
io and circuit limitations. Two of the design problems that 
are addressed by the present invention are (i) the im- 
plementation of two-dimensional decoding, necessary 
to achieve a reasonable array aspect ratio for a large 
memory; and (ii) achieving low power dissipation, de- 
15 spite the simultaneous searching of the entire contents 
of the large CAM. 

To achieve implementation of two-dimensional de- 
coding, there must be multiple words in a single physical 
row. This can be achieved by either (a) sharing match 
20 lines between multiple words, or (b) having multiple 
match lines per physical row. In this invention, the sec- 
ond approach is chosen. 

According to the most general aspect of the present 
invention, there is provided a content addressable mem- 
25 ory (CAM) comprising: w words of matching means, 
each word comprising i segmented match line chains, 
each match line chain comprising b/i core cells which 
are chained by a match line, each* core cell comprising 
means for storing data, w, i and b being integers; logic 
30 means for logically coupling the match line chains in a 
respective row; and encoding means for providing 
search results based on the outputs from the logic 
means. 

For example, the logic means comprises NAND cir- 
35 cuit means. In a case of the NAND circuit being dynamic 
NAND circuit, power dissipation decreases. The CAM 
may further comprise means for sensing the transitions 
of data signals on the chains. In the CAM, the cell arrays 
are segment pairs which are oriented in mirror images, 
to such that their outputs face each other, facilitating the 
logic combination of the two partial match results from 
the two cell arrays, by the logic means. 

The CAM may employ a dummy word, but it does 
not employ a dummy column. As well, the dummy words 
^5 model a match, rather than a mismatch. The words in 
the CAM are divided into a plurality of segments, with a 
match line chain per segment. The CAM uses purely 
voltage sensing means. 

50 Brief Description Of The Drawings 

The present invention will be further understood 
from the following description with reference to the ac- 
companying drawings, in which: 



Figure 1 illustrates a prior art CAM; 
Figure 2 illustrates a CAM according to an embod- 
iment of the present invention; 
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Figure 3 illustrates a logical equivalence of a NOR 
and NOT-AND gates; 

Figure 4 illustrates a logical segmentation of the 
NOT-AND gates; 

Figure 5A illustrates a match line circuit composed s 
of two segments; 

Figure 5B captures conceptually the two segment 
circuit shown in Figure 5A; 

Figure 6A illustrates a single word of storage with 
its match line; w 
Figure 6B illustrates four words sharing an encoder; 
Figure 7A illustrates a single physical row with two 
encoders; 

Figure 7B illustrates an array with a plurality of the 
rows illustrated in Figure 7 A; is 
. Figure 8A illustrates multiple words of four seg- 
ments each; 

. - Figure BB illustrates multiple words of eight seg- 
1S . ments each; 

, Figure 9 is a transistor-level schematic diagram of 
a memory core cell; 

Figure 1 0 is a view of memory core cell upper metal 
layers; 

.. Figure 11 illustrates a first example of a modified 
core cell; 

Figure 1 2 illustrates a second example of a modified 
core cell; 

Figure 13 illustrates a third example of a modified 
core cell; 

Figure 14 illustrates a dummy chain used to detect 30 
the completion of precharge; 
Figure 15 illustrates a simple implementation of a 
precharge sense circuit; 

Figure 16A illustrates a pull-down chain circuit with 
in-cycle precharge; 35 
Figure 16B illustrates signal timing of the pre- 
charge; 

Figure 17A illustrates a pull-up chain circuit with in- 
cycle. precharge; 

Figure 17B illustrates signal timing of the pre- 40 
charge; 

Figure 18A illustrates a pull-down chain circuit with 
quiescent precharge; 

Figure -18B illustrates signal timing of the pre- 
charge; 45 
Figure 1 9A illustrates a pull-up chain circuit with qui- 
escent precharge; 

Figure 19B illustrates signal timing of the pre- 
charge; 

Figure 20A illustrates a pull-down chain circuit with so 
quiescent precharge from both ends of the chain; 
Figure 20B illustrates signal timing of the pre- 
charge; 

Figure 2 1 A illustrates a pull-up chain circuit with qui- 
escent precharge from both ends of the chain; ss 
Figure 21 B illustrates signal timing of the pre- 
charge; 

Figure 22A illustrates a pull-down chain circuit with 



quiescent precharge and a single clock-controlled 
quiescent-off device; 

Figure 22B illustrates signal timing of the pre- 
charge; 

Figure 23A illustrates a pull-up chain circuit with qui- 
escent precharge and a single clock-controlled qui- 
escent-off device; 

Figure 23B illustrates signal timing of the pre- 
charge; 

Figure 24A illustrates a pull-down chain circuit with 
quiescent precharge and a single data-controlled 
quiescent-off device; 

Figure 24B illustrates signal timing of the pre- 
charge; 

Figure 25A illustrates a pull-up chain circuit with qui- 
escent precharge and a single data -controlled qui- 
escent-off device; 

Figure 25B illustrates signal timing of the pre- 
charge; 

Figure 26A illustrates a pull-down chain circuit with 
quiescent precharge and intentional charge shar- 
ing; 

Figure 26B illustrates signal timing of the pre- 
charge; 

Figure 27A illustrates a pull-up chain circuit with qui- 
escent precharge and intentional charge sharing; 
Figure 27B illustrates signal timing of the pre- 
charge; and 

Figure 28 illustrates an architecture in which col- 
umns are organized in a word-sliced fashion. 

Detailed Description 

The majority of prior art CAMs employ wired-NOR 
match lines, as shown in Figure 1 . Referring to Figure 
1 , a single chip CAM of w (=4) words x b (=4) bits is 
implemented as a CAM with w rows and b columns. The . 
memory array includes w x b (=16) memory core cells 
110, each being at the intersection of a match line 112 
and a pair of bit lines 114. A pair of bit lines 114 carry 
differential data representing a single bit, rather than two 
bits of data. Each core cell 110 acts to store a single bit 
of data and is capable of performing a single-bit com- 
parison (logical exclusive NOR (XNOR)) operation, in 
addition to its bit storage capability. In Figure 1 , the cells 
110 belonging to a given word are connected to the 
match line 112 of that word in a logical NOR fashion. 

The bit lines 114 for differential data are connected 
to reference word storage and bit line drivers 116 which 
receive input data D for loading the contents of the CAM 
and for the search reference word. Data stored in the 
array's core cells 110 are searched by applying a refer- 
ence word on the bit lines 114. 

When differential data is asserted on a pair of bit 
lines 114 in a search operation, the core cell 110 com- 
pares its stored data bit with the differential data (also 
known as reference data, or a single bit of the com- 
parand). When the stored data is not equal to the refer- 
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ence data, the core cell 110 pulls the match line 112 
(which is precharged to a logical high state) down to a 
low state. When the stored data is equal to the reference 
data, the cell 110 has no effect on the match line 112 to 
which it is connected. Because all b core cells 110 in a 
given word are connected to the match line 112 in the 
same way, the match line 112 will be pulled low if any 
bit in its word is unequal to (or mismatches) the corre- 
sponding reference bit. The match line 112 remains in 
a logical high state only if all bits in its word are equal to 
the reference data. 

The CAM includes an encoder 118 which produces 
three outputs representing the result of the search op- 
eration. A hit signal hit will be asserted to a logical high 
state, if any of the w words is storing data which has 
matched the reference data. The binary address of this 
matching word is encoded and an address signal sa is 
provided. In the event that a plurality of words have 
matched the reference data, a multi-match signal mult 
is asserted to a logical high state. In this event, the ad- 
dress sa output of the encoder 118 may produce (a) an 
invalid result, (b) an address representing the location 
of a single one of the multiple matches, or (c) a se- 
quence of outputs, representing the locations of each of 
the matched words. Note that some applications may 
not require the "mult" result. 

An alternative CAM of a large-capacity having time- 
shared comparators and multiple match lines running 
over each core cell is disclosed in United States Patent 
Application No. OB/748,928 entitled "Large-Capacity 
Content Addressable Memory", filed on November 14, 
1 996 by K.J. Schultz et al, which is incorporated herein 
by reference. The CAM employs a dynamic wired-NOR 
match line pull-down and includes single-bit compara- 
tors which are not located in the core cells, but are in- 
stead time-shared among multiple words. The circuit 
construction causes match lines connected to mis- 
matched words to experience transitions, while 
matched words cause no transitions. 

Because it is expected that only one, or at the most 
a very few match will occur, the number of transitions 
and the resulting power dissipation are excessive for a 
NOR match line. 

Figure 2 shows a CAM according to an embodiment 
of the present invention. Referring to Figure 2, there are 
w (=4) words shown, each occupying a row, and each 
having b (=8) bits and b (=8) core cells 210. The words 
are divided into two halves, and the results of the match 
on each half word are combined. Each of the two halves 
is provided with an array of 4 rows x 4 columns. The 
array includes 16 core cells 210, each being at the in- 
tersection of a match line 212 and a pair of bit lines 214 
which carry differential data representing a single bit. 
The bit lines 214 for differential data are connected to 
reference word storage and bit line drivers 216 which 
receive input data D for loading the contents of the CAM 
and lor the search reference word. Data stored in the 
array's core cells 210 are searched by applying a refer- 



ence word on the bit lines 214. 

Each core cell 210 acts to store a single bit of data 
and is capable of performing a single-bit comparison 
(logical exclusive NOR (XNOR)) operation, in addition 
5 to its bit storage capability. In Figure 2, the cells 210 be- 
longing to a given word are connected to the match line 
212 of that word in a logical NAND fashion. The core 
cells 210 of each word are chained in the respective 
match line 212. One end of each chain is connected to 
io an inverter 218. The other end of the chain is connected 
to a terminal of logic 0. The outputs of the inverters 218 
are connected to an AND gate 220, the output of which 
is referred to as a "word match line" 224, and which is 
connected to an encoder 222. 
'5 in Figure 2, the connection (in each half word) is in 
a logical NAND. The match line 212 will only have a 
downward transition, if all of the bits in the half word are 
equal to the reference data. Hence, the path to ground 
for the match line 212 is serial (a "match line chain") rath- 
20 er than parallel, and the path is made conductive (i.e., 
the circuit is closed) in the event of a match, rather than 
a mismatch. 

The advantage of this technique is due to the much 
smaller number of match lines 212 subject to a transition 
25 in each search operation: one per match in the embod- 
iment shown in Figure 2, compared to one per mismatch 
in the prior art circuit shown in Figure 1. This reduces 
power dissipation considerably, allowing the realization 
of larger storage capacities. The division of the word into 
30 halves decreases the length of the NAND chain, thus 
increasing speed. 

The embodiment of a CAM shown in Figure 2 also 
includes means of placing multiple words in a physical 
row, by employing an upper metal layer above the core 
35 cell for multiple word match lines 224. This further in- 
creases the storage capacity that can be realized. 

The CAM produces three output signals hit, sa and 
mult that represent the result of the search operation, 
and these may all be generated by the encoder 222. The 
40 hit signal hit is asserted to a logical high state if any of 
the w words is storing data which has matched the ref- 
erence data. The binary address of this matching word 
is encoded and the address signal sa is provided. In the 
event that a plurality of words have matched the refer- 
45 ence data, a multi-match signal mult is asserted to a log- 
ical high state. In this event, the address sa output of 
the encoder 222 may produce (a) an invalid result, (b) 
an address representing the location of a single one of 
the multiple matches, or (c) a sequence of outputs, rep- 
50 resenting the locations of each of the matched words. 

The CAM shown in Figure 2 employs a lower-power 
circuit of equivalent function to the NOR match line. A 
logical equivalent of a NOR is a NOT- AND, as shown in 
Figure 3. Thus, instead of checking whether any bit mis- 
55 matches, and producing a logical ml = match, jt may be 
checked whether all bits match (which, if true, will result 
in a pull-down), yielding the logical mln = match. This 
choice of Boolean implementation results in far fewer 
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transitions (because transitions only occur in the case 
of matches), and therefore, lower power dissipation. Un- 
fortunately, it is obvious that a dynamic NAND will be 
slower than a dynamic NOR. This is especially true for 
the wide words (b >16). On the one hand, delay caused 
by such a long pull-down chain becomes tolerable in ad- 
vanced CMOS technologies, especially in relation to a 
-slow" (<100 MHz) application. On the other hand, it 
would be beneficial to take some steps to reduce the 
delay. This is easily done by dividing the word into a 
number i 3 2 segments, with one series chain per seg- 
ment, combining the results as shown in Figure 4. This 
satisfies the concurrent design goals of high speed and 
wide words. 

It is clear that, when each first-stage NAND is im- 
plemented as a dynamic gate, it results in more transi- 
tions and higher power due to matches of partial words. 
Hence, there results a continuous power-speed trade- 
off: the more segments into which the word is divided, 
the higher the speed and power, with the logical extreme 
in both speed and power being the NOR. At the other 
extreme of low speed and low power is the single NAND 
chain, which is equally undesirable. Because the sim- 
plest segmentation to implement is a division into i = 2 
chains, this will serve as the example embodiments 
shown Figures 5A, 6A and 7A. 

Figure 5A shows a match line circuit composed of 
two segments (two chains) which are used in the CAM 
shown in Figure 2. Figure 5B shows conceptually first 
arid second segments (chains) 522 and 524. Referring 
to Figure 5A, N-channel FETs of the match line chains 
are shown explicitly with core cells. Each of the match 
line chains is connected to the inverter 218 and each 
output of the two inverters 21 8 is connected to the AND 
gate 220. With the word divided in two, the second chain 
524 is oriented as the mirror image of the first chain 522, 
and they are brought together, such that their outputs 
face each other. In Figure 5A, the chain is physically im- 
plemented as a pull-down chaia with a GND (or VSS) 
connection used at one end opposite the output of the 
chain. Alternatively, the chain may physically be imple- 
mented as a pull-up chain, with a power (or VDD) con- 
nection used at one end opposite the output of the chain 
and with the inverters logically removed. 

The word match line must be routed to an encoder, 
where the actual physical address information is derived 
from the match information, as described above, and 
shown in Figure 2. This routing is most easily done 
above the core cells, in the highest level of metal avail- 
able. This is done in Figure 6A for a single word, assum- 
ing the .encoder (not shown) is to the right. One may, by 
extension, run multiple match lines over each core cell, 
and place multiple columns of words adjacent to the 
same encoder, as shown in Figure 6B. A capacity that 
is a multiple of four words may be achieved by stacking 
a plurality of these units in the vertical dimension. 

If the number of word match lines over each core 
cell (actually, the number of word match lines over the 
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core cells closest to the encoder) is m, there will be m 
words adjacent to each other, with no intervening en- 
coder. As shown in Figures 7 A and 7B, one is not limited 
to a single encoder. There are 2*m words associated 
5 with each encoder, per row. 

If the number of encoders is e ; there will be 2*m*e 
words per row. II the number of rows is r, the array will 
have a capacity of 2*m*e*r words. The outputs of the e 
encoders may be combined at the physical bottom or 
io top of the array shown in Figure 7B. 

If, instead of dividing a word into exactly i = 2 seg- 
ments, it is divided into i > 2 segments, the output of the 
circuit such as that shown in Figures 5A and B repre- 
sents a partial word result, rather than a true word match 
75 line. These partial word results may be combined imme- 
diately adjacent to the encoder, as shown in Figures 8A 
and 8B. 

The parameters are summarized as follows: 

20 • The number of word match lines running above 
each core cell is m 

• The number of word match lines plus partial word 
result lines running above each core cell is i *m/2 
(we assume here that i is even, but point out that it 

25 could be odd). 

• The number of word match lines per physical row, 
also equal to the number of words per physical row, 
is 2*m*e. 

• The number of inputs to each encoder per row is 
30 2*m. 

Figure 9 shows an example of the memory core cell 
210 of the CAM shown in Figure 2. The core cell shown 
in Figure 9 is a transistor-level core cell. The cell is com- 

35 prised of two cross-coupled inverters between the two 
static storage nodes, and two access FETs 710 and 712 
gated by a word line wl, joining the storage nodes c and 
cn to a pair of bit lines bl and bin. This is a known ar- 
rangement for a static SRAM core cell (inverters of P- 

40 and N-channel FETs). Other three N-channel FETs 721 , 
723 and 725 form the comparison portion of the cell. The 
gate, drain and source of the FET 721 are connected to 
the negative storage node cn, a negative compare bit 
line kn and the gate of the FET 723, respectively. The 

45 gate, drain and source of the FET 725 are connected to 
the positive storage node c, a positive comparison bit 
line k and the gate of the FET 723, respectively. The 
FET 723 itself forms a part of the match line chain and 
constitutes the device shown explicitly in Figures 5A and 

50 8A, with its source and drain connected, as appropriate 
to its location in the chain, to either similar FET in neigh- 
bouring cells, or to the circuitry comprising either end of 
the chain. 

The compare bit lines k/kn are separate physical 
55 wires from the normal bit line pair bl/bln, running parallel 
to them in the vertical direction. Employing separate 
lines decreases the capacitive load on each, hence de- 
creasing power dissipation and increasing speed. It also 
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allows the independent setting ot the quiescent states 
at the storage access devices, and the comparison de- 
vices; bl/bln may be kept high during a search, in prep- 
aration tor the next read or write, and k/kn may be kept 
high or low during a read or write, in preparation for the 
next search. 

The function achieved by the cell is the combination 
ot (a) the storage of a bit ot data, and (b) the switching 
of a device in the match line chain: on in the case of the 
compare bit matching the stored bit, and off in the case 
of the compare bit mismatching the stored bit. In other 
words, if the binary value stored in the cell at node c, 
matches the comparand on line k, the FET 723 is con- 
ducting. To mask a given bit from the search, one must 
ensure that it always matches, and that the chain the 
FET 723 is always conducting; this is done by asserting 
both kand kn high. 

Assuming i = 2 and m = 4, and ignoring the power 
rails VDD/VSS, the global signal upper-layer metal view 
ot the core cell is shown in Figure 10. AH layers from 
second metal and up are shown. Referring to Figure 1 0, 
the layer of the third metals is overlying the layer of the 
second metal and underneath the layer of the fourth and 
fifth metals. The second metal is the word line metal ( wl). 
The third metals are the bit line metals (bl and bin) and 
comparison bit line metals (k and kn). The fourth and 
fifth metals are the match line metals. The number of 
horizontal signals in the top layer of metal would be the 
same for i = 4 and m = 2, or i = 8 and m = 1 . 

The core cell may be modified, at the transistor lev- 
el, in three different ways, to yield a cell that always 
matches; that is, it occupies the same area as a real 
core cell, and uses transistors of the same size, but the 
result of the search is guaranteed to be the conduction 
of the chain device. These three modified core cells are 
useful in the implementation of a dummy (or model) 
word or chain, and are shown in Figures 11, 12, and 13. 
The cell shown in Figure 11 has its chain device contin- 
ually on — there is never a discharge path for the gate 
of the chain device. The cell shown in Figure 12 (which 
is referred to as "conditionally on") matches when either 
or both of k/kn are high; when both are low, the chain 
device is off. Two additional devices (not shown) may 
be included in the cell shown in Figure 12 for the purpose 
of providing model load on the word line. In Figure 13, 
node cn is continually pulled up by the diode-connected 
P-channel FET 731 , and is also pulled up to VDD when- 
ever the word line wl is asserted (the VDD connection 
is not strictly necessary, and may be replaced with a 
floating drain, depending on which option presents the 
more convenient layout topology). As a result, node c is 
held low. The combination of cn high at the gate of a 
FET 741 , and VDD applied at the source of the FET 741 , 
which is connected to a FET 743, results a FET 745 con- 
ducting continually, as desired. 

A dummy chain may be used for two purposes. The 
first purpose is to determine when sufficient time has 
been allotted for precharge of the chain. A chain that 



w 



15 



20 



25 



30 



35 



40 



45 



50 



55 



maybe used for this purpose is shown in Figure 14. The 
chain is made up of a number of consecutive "always 
on" cells 810, such as that shown in Figure 11, a single 
"conditionally on" cell 812, such as the one shown in 
Figure 12, and a precharge sense circuit 814, such as 
the one shown in Figure 1 5. The precharge sense circuit 
includes a FET 821 and an inverter. A precharge signal 
is fed to the gate of the FET 821 and a precharge done 
signal is provided from the output of the inverter 823. 
The "conditionally on" cell 812 is included to reflect the 
condition. that precharge cannot be completed until the 
k/kn signals have been asserted to their valid search 
voltages. To more properly reflect the propagation delay 
through the chain as a result of k/kn assertion, the entire 
dummy chain may be made up of "conditionally on" cells 
(this implementation is not shown). The precharge sens- 
ing dummy chain is not required in all implementations, 
as is described below. 

The second purpose of a dummy chain is as a seg- 
ment of a dummy word, to determine the time interval 
for successful evaluation of a match. A word that always 
matches is used to generate a timing signal which can 
be used to clock the evaluation of all other match sig- 
nals. This word may be constructed with the chain as 
shown in Figure 14, or with all "conditionally on" cells, 
or with the cells shown in Figure 1 3. Note that the pre- 
charge sense circuit is not required. The timing signal 
that is generated can further be used to clock the en- 
coder, or as part of a self-timing path for the CAM as a 
whole. The match evaluation timing dummy word can 
be used with all implementations. 

Note that the CAM architecture results in inherently 
deterministic search delays. All match lines begin in a 
mismatch state, and all have equivalent delay to transi- 
tion to a match state. Hence, in modelling a match, one 
is guaranteed to be modelling the slowest condition. In 
the prior art case of a NOR match line, all match lines 
begin in a match state, and the speed of the transition 
to a mismatch state depends on the number of bits mis- 
matching. The important timing condition of validity of 
the match state on the match line must be inferred by 
observing the slowest possible mismatch. 

There are many possible ways to implement a 
match line chain, in terms of polarity selection, and pre- 
charge and evaluation timing and control. The following 
descriptions cover a variety of the possible implemen- 
tations, but are not meant to limit the scope of the inven- 
tion. A person reasonably skilled in the art could devise 
other similar techniques. 

Figures 16Aand 16B show a pull-down chain circuit 
and signal timing of the precharge, respectively. Figures 
1 7A and 1 7B show a pull-up chain circuit and signal tim- 
ing of the precharge, respectively. Each of the chain cir- 
cuits shown in Figures 1 6A and 17A is used in conjunc- 
tion with a precharge pulse that occurs after the begin- 
ning of the cycle. Precharge pulses are fed to the gates 
of FETs 831 and 833 which are connected to the chains. 
To avoid charge-sharing problems following the pre- 
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charge, precharge must overlap the assertion of valid 
data on k/kn. A dummy chain for precharge sensing is 
required to determine when precharge has completed, 
and to initiate timing of the remainder of the search op- 
eration. In the case of a match, there is a very resistive 
power-GND (or VDD-VSS) path through the entire chain 
toward the end of the precharge operation. 

If the precharge begins at the end of the clock cycle 
(i.e., if the quiescent state of the signals applied to the 
chain leaves it in the precharged state), there is no need 
to sense the completion of precharge before initiating 
the remainder of the search operation. In this case, pre- 
charge completion simply represents a minimum re- 
quirement for the cycle time of the CAM. This observa- 
tion applies to all of the remaining chains described in 
this disclosure. 

The chains shown in Figures 1 8A and 1 9A are iden- 
tical to those of Figures 16A and 17A, except for pre- 
■charge timing. Note that k/kn are also now in a quiescent 
high state,.allowing complete precharging of all interme- 
diate nodes in the chain. Figure 18A shows this chain 
designed to implement a pull-down, and Figure 19A 
shows the same concept with a pull-up. Figures 1 8B and 
19B show precharge timing in the chains shown in Fig- 
ures 18A and 19A, respectively. 

Because precharging from one end of the chain on- 
ly may be too slow for some applications, precharge de- 
vices may be placed at both ends of the chain. This re- 
quires the addition of a third device to switch off the eval- 
uation path to the opposite supply during precharge, in 
order to avoid power-GND current, which would be sig- 
nificant otherwise. Timing is the same as in the case of 
the chains of Figures 18A and 19A. Figure 20A shows 
this chain designed to implement a pull-down, and Fig- 
ure 21 A shows the same concept with a pull-up. The 
chain shown in Figure 20Ahas series -connected P-and 
N-channel FETs 841 and 843 in a power-GND (or VDD- 
VSS) path, the junction of the FETs being connected to 
the end of the chain. Similarly, the chain shown in Figure 
21 A has series-connected P- and N-channel FETs 851 
and 853 in a power-GND (or VDD-VSS) path, the junc- 
tion of the FETs being connected to the end of the chain. 
Due to FET gating signal selection, neither of these 
VDD-VSS paths are conductive, except during pre/pre 
transition. Figures 20B and 21 B show precharge timing 
in the chains shown in Figures 20A and 21 A, respec- 
tively. 

Precharging from a single end of the chain may be 
fast enough, but the circuits of Figure's ISAand 19Amay 
be undesirable due to the resistive power-GND current 
path in the quiescent precharge state. The chains 
shown in Figures 22A and 23A solve this problem by 
including a transistor to prevent evaluation until pre- 
charge has completed. Figure 22A shows this chain de- 
signed to implement a pull-down, and Figure 23A shows 
the same concept with a pull-up. The chain shown in 
Figure 22A has an N-channel FET 861 connected be- 
tween the end of the chain and the ground. The chain 



shown in Figure 23A has a P-channel FET 863 connect- 
ed between the end of the chain and a power VDD ter- 
minal. Precharge pulses are fed to the gates of the FETs 
861 and 863. Figures 22B and 23B show precharge tim- 
5 • ing in the chains shown in Figures 22A and 23A, respec- 
tively. 

It may be desirable to, at the same time, limit the 
number of clocked devices in the chain (to limit clocking 
power), and still remove the possibility of a resistive 
io power-GND short during precharge. It may be prefera- 
ble to precharge from one end only, as in Figures 18A, 
19A, 22A and 23A, but it is preferable to use a data- 
controlled device to prevent short-circuit current, in- 
stead of the clock-controlled device of Figures 22A and 
5 23A. Hence, there are provided circuits that drive the k/ 
kn signals (referred to as "k-drrvers") such that the k- 
drivers corresponding to the column furthest from the 
precharge device have a quiescent low/low state, rather 
than high/high, as is the case in the other columns, and 
o as was the case in Figures 18A, 19A, 20A, 21 A, 22A 
and 23A. The chain design is shown in Figures 24A and 
25A. with the two different types of k signals conceptu- 
ally shown as a single line through the chain device, la- 
belled either kL or kH. Note that the core cell employed. 
? is still that of Figure 9, and the k line over the chain de- 
vice is merely conceptual. Figure 24A shows this chain 
designed to implement a pull-down, and Figure 25A 
shows the same concept with a pull-up. Figures 24B and 
25B show precharge timing in the chains shown in Fig- 
30 ures 24A and 25A, respectively. 

With all previous chain designs, it is a design goal 
to totally eliminate the possibility of charge sharing. If 
the chain length is not required to be variable, and is 
instead fixed, some charge sharing could be tolerable, 
35 and may be intentionally designed into the chain. Pre- 
charge delay and power can be decreased slightly by 
leaving a small number of chain nodes unprecharged. 
This technique is shown in Figures 26A and 27 A. The 
kL column is moved away from the end of the chain op- 
40 posite the precharge device toward the middle of the 
chain. Columns to the left of the kL column may be either 
kL or kH driven. In the statistically unlikely event of a 
match of all of the bits in the kH columns, but a mismatch' 
in one or more of the other columns, the charge (or lack 
45 of charge) on the unprecharged nodes will cause the 
voltage at the inverter gate to move to an intermediate 
value which must not be evaluated as a match. If this 
situation is understood and accounted for, it should be 
tolerable. Figure 26A shows this chain designed to im- 
50 plement a pull-down, and Figure 27A shows the same 
concept with a pull-up. Figures 26B and 27B show pre- 
charge timing in the chains shown in Figures 26A and 
27A, respectively. 

In all example designs shown, represented is the 
55 match sense circuit as a simple inverter. In fact, this cir- 
cuit may be implemented as any static or dynamic volt- 
age sense device. 

Because some target applications are likely to have 
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a high degree of commonality between matches and 
mismatches (i.e., they may differ in only a few bits, and 
have long strings of consecutive bits in common), situ- 
ations may arise in which excessive power is dissipated 
in the precharge of these "near-misses". This is espe- 5 
daily true if word segments (entire chains) match in mis- 
matched words. Note that this situation has no counter- 
part in traditional wired-NOR match line CAMs. 

One example of the above situation (not intended 
to limit applicability) is in ATM address look-up. The ad- io 
dress is composed of two fields: Virtual Channel Identi- 
fier (VCI) and Virtual Path Identifier (VPI). Many entries 
may (a) have identical VPIs, and differ in a lew bits of 
the VCI, or (b) have identical VClsi and differ in a few 
bits of the VPI. In order to limit power dissipation in this 15 
event, it is beneficial to scramble the order of the bits in 
the chain. 

In most memories, columns are organized in a bit- 
sliced fashion, such that all columns associated with a 
given data bit are grouped together. The architecture de- 
scribed in this disclosure is word-sliced, as shown in Fig- 
ure 28, instead of bit-sliced. Because of this, a global 
data bus 910, joining all words (that is, joining all col- 
umns associated with each of the bits) is required. Some 
observations about this bus: 

• The bus 91 0 may be used for search, read or write 
operations, supplying bidirectional data. 
All drivers onto the bus 91 0 must be tri-statable. 

Although particular embodiment of the present in- 
vention have been described in detail, it should be ap- 
preciated that numerous variations, modifications, and 
adaptations may be made without departing from the 
scope of the present invention as defined in the claims. 35 
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Claims 

1 . A content addressable memory (CAM) comprising: 40 

w words of matching means, each word com- 
prising i segmented match line chains, each 
match line chain comprising b/i core cells which 
are chained by a match line, each core cell 45 
comprising means for storing data, w, i and b 
being integers; 

logic means for logically coupling the match line 
chains in a respective row; and 
encoding means for providing search results 50 
based on the outputs from the logic means. 



11. 



sensing the transitions of data signals on the 
chains. 

The CAM of claim 3, wherein the polarity of data 
signal swing to signify a match is up or down. 

The CAM of claim t , wherein i=2 and a pair of match 
line chains are oriented in mirror images, such that 
their outputs face each other, facilitating the logic 
combination of the two partial match results from 
the two chains, by a logic means. 

The CAM of claim 5, wherein the logic means com- 
prises AND logic means and the word match lines, 
resulting from the ANDing of partial match results 
on the match chains, one word match line per word, 
are routed so as to convey the match result to the 
encoding means. 

The CAM of claim 6, wherein the word match lines 
are selectively routed physically over the core cells 
in a metal layer above all other signals used in the 
core cell. 

The CAM of claim 6, wherein the encoding means 
comprises a plurality of encoders. 

The CAM of claim 6, wherein multiple words are 
placed adjacent to the encoding means, with multi- 
ple word match lines running over each core cell, 
allowing multiple word match lines to be routed per 
row. 

The CAM of claim 9, wherein the word match lines 
are selectively routed over the core cells in a metal 
layer above all other signals used in the core cell. 

The CAM of claim 6, wherein the partial match re- 
sults are selectively routed above the core cell, to 
be combined in the AND logic means. 



2. 



The CAM of claim 1, wherein the logic means com- 
prises NAND circuit means which provides logical 
outputs in response to the outputs from the match 55 
line chains in a respective row. 



3. The CAM of claim 1, further comprising means for 



12. The CAM of claim 1 1 , wherein the results from pairs 
of match line segments are first combined by a log- 
ical AND operation into a secondary partial result, 
the result then being routed above the core cell, to 
be combined in a final AND gate adjacent to the en- 
coder. 

13. The CAM of claim 1 , wherein the core cell compris- 
es a plurality of field effect transistors (FETs) for da- 
ta storing operation, and further comprising logic 
level/p recharging means for supplying a first binary 
logic level to one end of the chain and for precharg- 
ing the match line chain to a second binary logic 
level in response to a clock signal. 

14. The CAM of claim 13, wherein the logic level/pre- 
charging means comprises means for tying the one 
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end of the chain directly to the a power supply of 
the first binary logic level. 

15. The CAM of claim 13, wherein the logic levei/pre- 
charging means comprises a first FET at the other s 
end of the chain nearest match sense circuitry. 

16. The CAM of claim 15, wherein the logic level/pre- 
charging means further comprises means tor pre- 
charging the chain to the second binary logic level, io 
commencing following the beginning of the clock 
cycle during which the search is to be performed, 
by the first FET. 

17. The CAM of claim 15, wherein the logic level/pre- is 
charging means further comprises means for pre- 
charging the chain to the second binary logic level, 
commencing following the completion of search op- 
eration, continuing during the non-searching quies- 
cent state of the CAM, and ceasing prior to the ini- 20 
tiation of the next search operation, by the first FET, 
the chain's FETs being in a conducting state during 
precharge, due to the high logic level on their gates. 

18. The CAM of claim 13, wherein the logic level/pre- 25 
charging means comprises the . first FET at one end 
of the chain and a second FET at the other end of 
the chain farthest from match sense circuitry. 

19. The CAM of claim 18, wherein the logic level/pre- 30 
charging means comprises means for connecting 
the one end of the chain to a power supply of the 
first binary logic level, by a third FET which is made 
conducting when the first and second FET are non- 
conducting, the chain's FETs being in a conducting 35 
state during precharge, due to the high logic level 
on their gates. 



20. The CAM of claim 18, wherein the second FET is 
conducting when the first FET is non-conducting. 40 

21. The CAM of claim 13, wherein, 

the logic level/precharging means comprises 
logic level means and precharging means, 45 
the precharging means being for precharging 
means comprises means for precharging the 
chain to the second binary logic level, com- 
mencing following the completion of the search 
operation, continuing during the non-searching so 
quiescent state of the CAM, and ceasing prior 
to the initiation of the next search operation, by 
a first FET at the end of the chain nearest match 
sense circuitry, 

the logic level means being for directly tying the ss 
one end of the chain to a power supply of the 
first binary logic level, 

the core cell in the chain at the extreme one end 



to the match sense circuitry comprising FETs 
identical to those of the other core cells in the 
chain, the compare data being applied at the 
opposite polarity during the quiescent pre- 
charge state, such that the chain's FET is the 
only such FET in the chain which is non-con- 
ducting in the quiescent precharge state. 

22. The CAM of claim 13, wherein, 

the logic level/precharging means comprises 
logic level means and precharging means, 
the precharging means being for precharging 
the chain to the second binary logic level, com- 
mencing following the completion of the search 
operation, continuing during the non-searching 
quiescent state of the CAM, and ceasing prior 
to the initiation of the next search operation, by 
a first FET at the end of the chain nearest match 
sense circuitry, 

the logic level means being for directly tying the 
chain to a power supply of the first binary logic 
level, 

the core cell at a predetermined and constant 
location in the chain comprises FETs identical 
to those of the other core cells in the chain, the 
compare data being applied at the opposite po- 
larity during the quiescent precharge state, 
such that the chain's FET is the only such FET 
in the chain which is non-conducting in the qui- 
escent precharge state, thereby any charge 
sharing occurring during the search of the chain 
tending to make a mismatch appear somewhat 
like a match, but with the degree of the charge 
sharing understood and accounted for. 

23. The CAM of claim 1 , wherein the order of the bits in 
the chains, and among the chains making up each 
word, is intentionally and deterministically scram- 
bled, in order to limit match line, or partial match 
result, transitions and power dissipation. 

24. The CAM of claim 1 3, wherein the core cell com- 
prises: 

first and second N-channel FETs and first and 
second P-channel FETs forming two cross-cou- 
pled inverters for storing differential data; 
third and fourth N-channel FETs coupled with 
the differential data nodes to differential bit 
lines, the FETs being gated by a word line, so 
as to provide access for read and write data op- 
erations; 

a fifth N-channel FET; 

sixth and seventh N-channel FETs, either of 
them being gated by the positive storage node 
and for connecting a positive comparison bit 
line to the gate of the fifth N-channel FET, the 
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other being gated by the negative storage node 
and for connecting a negative comparison bit 
line to the gate of the fifth N-channel FET, the 
comparison bit lines being physically distinct 
from the bit lines employed for read and write s 
access, the source and drain of the fifth N- 
channel FET being connected to those of 
neighbouring cells to form a chain of such FETs. 



25. A dummy match line chain for use in a CAM com- 
prising an array of data store cells coupled with bit 
lines, the dummy match line chain comprising CAM 
cells. 

26. The match line chain of claim 25, wherein the CAM 
cell is a model CAM cell, which is guaranteed to 
match and has its chain FET conducting, the model 
CAM cell comprising: 



two further N-channel FETs, one being gated 
by the inverted positive comparison bit line, and 
connecting the negative comparison bit line to 
the gate of a third further N-channel FET, the 
other being gated by the inverted negative com- 
parison bit line, and connecting the positive 
comparison bit line to the gate of the same third 
further N-channel FET; 

third further N-channel FET, its gate connected 
as described above, with source and drain con- 
nected to like transistors in neighbouring cells 
(or to appropriate other circuits in the boundary 
cases) to form a chain of such devices. 

75 28. The match line chain of claim 25, wherein the CAM 
cell is a model CAM cell, which is guaranteed to 
match and has its chain FET conducting, the model 
cell comprising: 

a pair of differential bit lines, selectively includ- 
ed, but electrically unconnected to the remain- 
der of the core cell; 

a negative comparison bit line, selectively in- 
cluded, but electrically unconnected to the re- 
mainder of the core cell; 
two N-channel FETs, connecting the static dif- 
ferential storage nodes to: the positive supply 
voltage, in the case of the negative static stor- 
age node, and a floating source, in the case of 
the positive static storage node, gated by a 
word line; 

a first P-channel FET, it source connected to 
the positive supply voltage, and its gate and 
drain connected together to the negative static 
storage node, which is kept in a logical high 
state by the arrangement; 
a third N-channel FET, its source connected to 
the negative supply voltage, its gate connected 
to the negative static storage node, said node 
pulled high by the first P-channel FET, resulting 
in its conduction such that its drain, connected 
to the positive static storage node, is kept in a 
logical low state by the arrangement; 
a second P-channel FET, its source connected 
to the positive supply voltage, its gate connect- 
ed to the positive static storage node, said node 
being kept in a logical low state by the third N- 
channel FET as described above, resulting in 
its conduction such that its drain, connected to 
the negative static storage node, is reinforced 
in its logical high state imposed by the first P- 
chanhel transistor, described above; 
a fourth N-channel FET, its source connected 
to the negative supply voltage, its gate connect- 
ed to the positive static storage node, resulting 
in it being in a non-conducting state, and its 
drain electrically unconnected to any other 
node; 



a combination of two N-channel FETs and two 20 
P-channel FETs forming two cross-coupled in- 
verters, implementing static differential data 
storage; 

two additional N-channel FETs, connecting the 
static differential data nodes to differential bit 25 
lines, gated by a word line, for the purpose of 
providing access for read and write operations; 
two further N-channel FETs, one being gated 
by a positive comparison bit line, and connect- 
ing a positive comparison bit line to the gate of 30 
a third further N-channel FET, the other being 
gated by a negative comparison bit line, and 
connecting a negative comparison bit line to the 
gate of the same third further N-channel FET; 
comparison bit lines being physically distinct 35 
from the bit lines employed for read and write 
access; 

third further N-channel FET, its gate connected 
as described above, with source and drain con- 
nected to like transistors in neighbouring cells 40 
., to form a chain of such devices. 

27. The match line chain of claim 25, wherein the CAM 
cell is a model CAM cell, which is guaranteed to 
match when one of the comparison bit lines is in a 45 
logical high state, and has its chain FET conducting 
in that case, the model CAM cell comprising: 

two optional N-channel FETs, with drain and 
source unconnected, gated by a word line, for so 
the purpose of providing model loading to the 
word line; 

a combination of an N-channel FET and a P- 
channel FET forming an inverter, inverting the 
logical state of a positive comparison bit line; ss 
a combination of an N-channel FET and a P- 
channel FET forming an inverter, inverting the 
logical state of a negative comparison bit line; 
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two further N-channel FETs: the first gated by 
the positive static storage node, and connect- 
ing a positive comparison bit line to the gate of 
a third further N-channel FET, said device being 
in a non-conducting state due to the logical low s 
state of its gate; the second gated by the neg- 
ative static storage node, and connecting the 
positive supply voltage to the gate of the same 
third further N-channel FET, said device being 
in a conducting state due to the logical high io 
state of its gate; 

any comparison bit line or lines present being 
physically distinct from any present bit line or 
lines employed for read and write access; 
third further N-channel FET, its gate connected is 
as described above and in a high state, the FET 
being in a conducting state, with source and 
drain connected to like transistors in neighbour- 
ing cells to form a chain of such devices. 

20 

29. The match line chain of claim 28, wherein in the 
model CAM cell, either or both of the (i) positive sup- 
ply voltage connection to the transistor gated by the 
word line and also connected to the negative static 
storage node, and the (ii) positive comparison bit 2B 
line connection to the first further N-channel FET, 
may be replaced with a floating or unconnected 
node. 



each chain has the same number of cells as an 
actual storage chain, but composed of dummy 
core cells, instead of real core cells; 
• each chain always conducts to model a match, 
and responding to compare bit line transitions 
in the same way as a real chain with a match. 

34. The CAM of claim t , further comprising a global da- 
ta bus, connecting peripheral circuitry to the words 
of the CAM, wherein the bus joins all words and is 
used for search, read or write operations, supplying 
bi-directional data, and all drivers onto the bus are 
tri-statable. 

35. A core cell comprises: 

first and second N-channel FETs and first and 
second P-channel FETs forming twocross-cou- 
pled inverters for storing differential data; 
third and fourth N-channel FETs coupled with 
the differential data nodes to differential bit 
lines, the FETs being gated by a word line, so 
as to provide access for read and write data op- 
erations; 

a fifth N-channel FET; 

sixth and seventh N-channel FETs, either of 
them being gated by the positive storage node 
and for connecting a positive comparison bit 
line to the gate of the fifth N-channel FET, the 
other being gated by the negative storage node 
and for connecting a negative comparison bit 
line to the gate of the fifth N-channel FET, the 
comparison bit lines being physically distinct 
from the bit lines employed for read and write 
access, the source and drain of the fifth N- 
channel FET being connected to those of 
neighbouring cells to form a chain of such FETs. 



30. The match line chain of claim 28, wherein in the 30 
model CAM cell, all differential nodes are substitut- 
ed by their corresponding complementary polarities 

of positive and negative for negative and positive, 
respectively. 

35 

31. The match line chain of claim 29, wherein in the 
model CAM cell, all differential nodes are substitut- 
ed by their corresponding complementary polarities 
of positive and negative for negative and positive, 
respectively. 40 

32. The match line chain of claim 1, further comprising 
a dummy chain for determining when precharge of 
all chains is complete, the dummy chains compris- 
ing as many dummy cells as the core cells of an 45 
actual storage chain, the dummy chain always con- 
ducting to model a match, responding to compare 

bit line transitions in the same way as a real chain 
with a match, the dummy chain comprising means 
for detecting when precharge is complete, at its end so 
opposite precharge circuitry. 



33. The CAM of claim 1, further comprising a dummy 
word for determining when sufficient time has 
elapsed for a match to complete, wherein ss 

the word is segmented into the same number 
of chains as an actual word of storage; 
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